XCSF with tile coding in discontinuous action-value landscapes

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Tile Coding for Value Function Approximation

Reinforcement learning problems are commonly tackled by estimating the optimal value function. In many real-world problems, learning this value function requires a function approximator, which maps states to values via a parameterized function. In practice, the success of function approximators depends on the ability of the human designer to select an appropriate representation for the value fu...

متن کامل

Adaptive Tile Coding for Value Function Approximation

متن کامل

On Continuous-Action Q-Learning via Tile Coding Function Approximation

Reinforcement learning (RL) is a powerful machine-learning methodology that has an established theoretical foundation and has proven effective in a variety of small, simulated domains. There has been considerable work on applying RL, a method originally conceived for discrete state-action spaces, to problems with continuous states. The extension of RL to allow continuous actions, on the other h...

متن کامل

结合Tile Coding的平均奖赏学习算法 An Average Learning Algorithm with Tile Coding

Average reward reinforcement learning is an important undiscounted optimality framework.It tries to learn policy by maximizing the long term average reward,and it is more appropriate for cyclical tasks than the discounted framework. Researchers have presented various average reward methods with lots of experiments to verify their validity.However,most of the work was based on discrete state spa...

متن کامل

Tile Coding Based on Hyperplane Tiles

In large and continuous state-action spaces reinforcement learning heavily relies on function approximation techniques. Tile coding is a well-known function approximator that has been successfully applied to many reinforcement learning tasks. In this paper we introduce the hyperplane tile coding, in which the usual tiles are replaced by parameterized hyperplanes that approximate the action-valu...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Evolutionary Intelligence

سال: 2015

ISSN: 1864-5909,1864-5917

DOI: 10.1007/s12065-015-0129-7